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© Hepatitis delta diagnostics and vaccines, their preparation and use. 



© The entire genome of the hepatitis D virus has been shown to be a circular single-stranded RNA of 1679 
bases. Several open reading frames in both the genomic and complementary strands indicate possible protein 
products. The products encoded in one open reading frame, ORF5. are identified as viral polypeptides p24 5 and 
p27* , of which the nuclear 5 antigens in HDV infected liver is comprised. These products, as well as others 

^encoded in ORFs 1 t 2, 6, and 7 are produced in recombinant expression systems. The ORF5 products, in 

^particular, are useful for HDV diagnosis and vaccines. 
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HEPATITIS 6 DIAGNOSTICS AND VACCINES, THEIR PREPARATION AND USE 

The invention relates to materials and methodologies for managing the spread of hepatitis 5 infection. 
More specifically, it relates to diagnostic DNA fragments, diagnostic proteins, and protective antigens and 
antibodies with respect to hepatitis 5 virus. 

An unusual form of hepatitis virus, hepatitis D (HDV), also called 5 agent, was discovered in 1977 by 

5 Rizzetto, M., et al. Guy (1977) 18:997-1003. The virus was detected as a new antigen/antibody system by 
immunofluorescence in liver cells of patients infected with hepatitis B. Indeed, subsequent investigation 
showed that hepatitis D virus is dependent upon concomitant infection with hepatitis B in order to replicate. 
The nature of the helper function is not as yet understood. However, the HDV apparently contains a single- 
stranded RNA genome surrounded by a "5 antigen" protein, which is in turn surrounded by hepatitis B 

10 surface antigen (HBsAg) in a 35-37 nm particulate configuration (Rizzetto. M. et al, Proc Natl Acad Sci USA 
(1980) 77:6124-6128: Bonino, F., et al, Hepatology (1981) 1;127-131). Thus, the DNA produced during 
infection wilt have a "genomic" strand and a complementary strand. 

The epidemiology and mode of transmission appears to be similar for HDV to that of hepatitis B (HBV), 
in that it is transmitted through blood transfusion and by close direct contact of body fluids. Three patterns 

75 cf HDV (or 5) infection have been identified: acute 5 infection superimposed on chronic B, chronic 5 
superimposed on chronic B, and simultaneous acute 5 and hepatitis B infections (Schiff. E.R., et al t 
Diagnostic Medicine (March 1985) 17-22). While the disease was originally identified in the Mediterranean 
basin, it appears to be spreading worldwide (Jacobson, I.M., et al, Hepatology (1985) 5:188-191). A review 
of the demographic and epidemiological aspects of this disease is also found in Rizzetto. M. et al, J Hepatol 

20 (1985) 1_:1 87-1 93. 

Although the course of the disease has been well characterized and the general structure of the virion is 
understood, no information has previously been available as to the genetic structure of the virus, nor has the 
nature of the 5 antigen been characterized. The only available assay to detect the presence of the disease 
by using blood samples is an immunoassay marketed in Europe, which has not yet received FDA approval 

25 in the United States. Previous detection methods were limited to direct immunofluorescence in the nuclei of 
hepatocytes in biopsy specimens. One form of the assay is based on the ability of antibody in test serum to 
block binding of labeled IgG anti-5 to 5 antigen per se. Another configuration relies on the ability of IgM anti- 
5 from the test serum to bind antihuman IgM (specific for a chain) fixed to the solid phase, followed by the 
addition of standard 5 antigen and labeled IgM anti-5 so that the presence of IgM anti-5 in the test serum 

30 (along with the added 5 antigen) permits binding of labeled anti-5 IgM. Neither of these tests requires 
analysis of, or knowledge of, the 5 antigen protein structure of HDV genomic structure. 

It is now possible to design efficient probes for diagnosis of the disease by DNA hybridization, as well 
as to generate recombinant proteins usable as vaccines and as reagents in diagnostic testing. In addition, 
the recombinantly produced proteins can be used to generate antibodies useful for diagnosis or for passive 

35 therapy. 

The invention provides a family of cDNA replicas of an entire HDV genomic sequence. Portions of these 
cDNA sequences are useful as probes to diagnose the presence of virus in clinical samples, and to isolate 
naturally occurring variants of the virus. An understanding of the basic genomic sequence (and its 
complement) also makes available the polypeptide sequence of the 5 antigens encoded within one of the 

40 open reading frames and permits production of these peptides or portions thereof which are. useful as 
standards or reagents in diagnostic tests and as components of vaccines. Similarly, analysis of other open 
reading frames in either strand permits deduction of additional viral peptide sequences which are 
characteristic of HDV and may be similarly useful. Protective antibodies may also be raised from the 
recombinantly produced proteins and may be obtained in polyclonal or monoclonal form. 

45 The availability of an entire HDV sequence thus permits the design and construction of polypeptides 
which may either serve as vaccines or diagnostic reagents, or as intermediates in the production of 
monoclonal antibody (Mab) preparations useful in passive immunotherapy against the disease, or as 
intermediates in the production of antibodies useful as diagnostic reagents. Without the sequence of the 
entire genome at the disposal of the designer of therapeutic or preventive compositions, successful 

so production of optimally effective products would be impossible. 

Accordingly, in one aspect, the invention relates to nucleotide sequences useful for the production of 
HDV diagnostics and vaccines, derived from the HDV genome or its complement as represented in Figure 
2. The invention thus relates to utilizing this sequence or portions th reof as oligomeric probes, for 
production of peptides which can serve as diagnostic reagents or as vaccines, to these p ptides 
themselves, and to polyclonal and monoclonal antibodies useful in diagnosis and treatment of the dis ase. 
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Other aspects of the invention include expression systems which are capable of effecting the production 
of a desired protein encoded by sequences derived from the complete genome, to recombinant vectors 
containing such systems or portions thereof, to recombinant host cells transformed with such vectors, to 
proteins produced by the transformed cells, and to vaccines prepared from such proteins. In addition, the 

5 invention relates to specific peptide sequences representing epitopes encoded by the genome, and to such 
sequences covalently linked to label or to carrier proteins. Carrier proteins, in addition to more conventional 
carriers, include the 22nm particle associated with hepatitis B infection, which carries polyalbumin receptor 
sites, and is 1000-fold more immunogenic than the unassembled subunit component. By inserting antigenic 
HDV determinants into the 22nm HBsAG particle, increased immunogenicity for these epitopes is obtained. 

to The invention also relates to the methods of preparing these desired polypeptide vaccines and 
immunoglobulins, and to kits for assay containing the probes, polypeptides, and/or immunoglobulins. 

The invention further includes the use of a nucleotide sequence derived from the HDV genome or its 
complement as represented in Figure 2, said sequence optionally being as further defined comprising a 
region derived from open reading frames (ORFs) 1-12, and preferably is derived from ORF1, or ORF2, or 

15 ORFS, or ORF6, or 0RF7, and more perferably is derived from ORFS, in the preparation of a vaccine 
against HDV or in the preparation of a kit for the detection of material characteristic of HDV. 
In the accompanying drawings:- 

Rgure 1 shows a diagram of the HDV single-stranded RNA genome and the position of overlapping 
cDNA clones used to determine its structure. 

20 Figure 2 shows the complete nucleotide sequence of the double-stranded cDNA corresponding to the 

entire HDV RNA genome. 

Figure 3 shows the sequence of cDNA equivalent to the RNA of ORFS. The deduced amino acid 
sequence and heterogeneities in nucleotides as determined from other clones are also shown. 

Figure 4 shows the sequence of clone 51 useful in obtaining the nucleotide sequence of the virus. 
25 Figure 5 shows the hybridization of probe to viral RNA. 

Figure 6 shows gels demonstrating the production by E. coli of immunologically reactive HDV 
peptides. 

Figure 7 shows the positions of the ORFs of the HDV genome and its complement 
Figure 8 shows an immunoblot using HDV antiserum of the expressed products of ORFs 1 , 2, 6, and 
30 7 fused to SOD, and of the unfused expression product of ORFS. 

Figure 9 is a restriction map of pAB24, including some genetic features. 

Figure 10A shows an immunoblot using HDV antiserum of the unfused ORFS produce expressed in 
E col[ compared to antigens present in HDV particles and in infected liver lysates. 

Figure 10B shows an immunoblot demonstrating the competition for HDV antibodies between ORFS 
35 product expressed in yeast with p24 5 and P27 5 present in HDV particles. 

Figure 10C shows an immunoblot demonstrating the competition for HDV antibodies between ORFS 
product expressed in yeast and bacteria with p24 5 and p27* present in HDV infected liver. 

Figure 1 1 shows liver slices stained by an indirect immunoperoxidase staining method demonstrating 
that ORFS product expressed in yeast competes with liver HDV 6 antigen for HDV antibodies. 

40 

A. Definitions 

As used herein, a nucleotide sequence "derived from" the HDV genome or cDNA refers to a sequence 
45 which retains the essential properties of the illustrated polynucleotide, representing a portion of the entire 
sequence from which it is derived, for the purpose intended. A specific, but nonlimiting, example of such 
derivation would be represented by a sequence which encodes an identical or substantially identical amino 
acid sequence, but because of codon degeneracy, utilizes different specific codons; another example is the 
complementary strand. A probe or oligonucleotide useful in diagnostic tests needs to retain the complemen- 
so tarity of the sequence shown but may be shorter than the entire sequence or may skip over portions of it. 
However, for use in manipulation or expression, nucleotide changes are often desirable to create or delete 
restriction sites, provide processing sites, or to alter the encoded amino acid sequence in ways which do 
not adversely affect functionality. * Nucleotide sequence" refers both to a ribonucleotide and a deox- 
yribonucleotide sequence and includes both the genomic strand and its complementary strand. 
55 A DNA "derived from" the nucleotide sequence which comprises the genome of HDV therefore refers 
to a DNA sequence which is comprised of a sequence corresponding to that of a region of the g nomic 
nucleotide sequ nee (or its complement), or a combination of regions of that sequence modified in ways 
known in the art to be consist nt with its int nded use. These DNAs are, of course, not necessarily 
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physically derived from the nucleotide sequence of the gene, but refer to polynucleotides generated in 
whatever manner which are based on the information provided by the sequence of bases in the region(s) 
from which the polynucleotide is derived. For example, regions from which typical DNA sequences can be 
"derived" include regions encoding specific epitopes and regions encoding portions of 6 antigen. Similarly, 
5 a peptide "derived from" the 5 antigens refers to an amino acid sequence substantially identical to that of 
these polypeptides or a portion thereof, having the same biological properties as that portion. The manner 
of synthesis of such a "derived" peptide is not material-it may be chemical synthesis or recombinant 
means, for example. 

"Recombinant host cells", "host cells", "cells", "cell lines", "cell cultures", and other such terms 
70 denoting microorganisms or higher eukaryotic cell lines cultured as unicellular entities, are used inter- 
changeably, and refer to cells which can be, or have been, used as recipients for recombinant vector or 
other transfer DNA, and include the progeny of the original cell transfected. It is understood that the 
progeny of a single parental cell may not necessarily be completely identical in morphology or in genomic 
or total DNA complement as the original parent due to accidental or deliberate mutation. Progeny of the 
75 parental cell which are sufficiently similar to the parent to be characterized by the relevant property, such as 
the presence of a nucleotide sequence encoding a desired peptide, are included in the progeny intended 
by this definition, and are covered by the above terms. 

"Control sequence" refers to DNA sequences which are necessary to effect the expression of coding 
sequences to which they are ligated. The nature of such control sequences differs depending on the host 
20 organism; in prokaryotes, generally such control sequences include promoter and ribosome binding site; in 
eukaryotes, generally, such control sequences include promoters, terminators, and, in some instances, 
enhancers. The term "control sequences" is intended to include, at a minimum, all components whose 
presence is necessary for expression, and may also include additional components whose presence is 
advantageous. 

25 "Operably linked" refers to a juxtaposition wherein the components so described are in a relationship 
permitting them to function in their intended manner A control sequence "operably linked" to a coding 
sequence is ligated in such a way that expression of the coding sequence is achieved under conditions 
compatible with the control sequences. 

An "opening reading frame" is a region of a polynucleotide sequence which encodes for a polypeptide. 

30 "Immunologically identifiable with/as" refers to the presence of epitopes in the non-native, i.e., 
artificially synthesized or recombinant protein, which is also present in HDV viral proteins. These epitopes 
may be identified by their immunological reactivity with antibodies directed against the HDV proteins. Their 
presence in the non-native protein may be detected by direct reactivity with the HDV antibodies, as well as 
by competition assays between the non-native proteins and HDV proteins for antibodies to HDV proteins. 

35 Methods of detecting antibody binding and of determining competition in binding are known to those of 
average skill in the art, and are also illustrated infra. 

B. General Description 

40 

The useful materials and processes of the present invention are made possible by the provision of a 
family of nucleotide sequences each containing an entire genome of hepatitis D virus. The availability of this 
family of polynucleotides, first, permits the isolation of other members of the genome family which differ by 
small heterogeneities. Second, ft permits the construction of DNA and proteins useful in diagnosis with 

45 respect to DNA, oligomers of about 8-10 bp or more useful as hybridization probes in disease diagnosis. 
Such probes may be used to detect the presence of the virai genome in, for example, sera of subjects 
suspected of harboring the virus. The HDV sequences also allow the design and production of HDV-specrfic 
polypeptides which are useful as diagnostic reagents for the presence of antibodies raised by HDV in 
serum or blood. Antibodies raised against these polypeptides are also useful as diagnostics. (Because open 

so reading frames in addition to that for S antigen can be deciphered in the context of the complete genome or 
its complement, the primary structures of HDV-related proteins, other than 5 antigen per se, can be 
deduced. These may also be marker polypeptides, characteristic of the virus, and useful in diagnosis and, 
possibly, in immunization.) Finally, knowledge of the gene sequences also enables the design and 
production of vaccines effective against HDV and also production of protective antibodies. 

55 Sequencing information available from the genome allows the amino acid sequence of the & antigen or 
other polypeptid s to be deduced and suitable pitopes identified. The entir 5 antigen or suitable portions 
thereof can be produced by fragments of the relevant DNA which are obtained and xpressed indepen- 
dently, thus providing desired polypeptides using recombinant techniques. Both prokaryotic and eukaryotic 
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hosts are useful for such expr ssion. Short polypeptide fragments may also be chemically synthesized and 
linked to carrier proteins for use as vaccines. In addition, the epitopes may be produced linked to a protein 
conferring immunogenicity. The proteins thus produced may themselves be used as vaccines, or may be 
used to induce immunocompetent B cells in hosts, which B cells can then be used to produce hybridomas 
5 that secrete antibodies useful in passive immunotherapy. 



8,1 • Preparation of the HDV Gene Sequence 

70 The serum of chimpanzees infected with HDV and containing a high titer of the virus (about 10 11 chimp 
infectious disease dose/ml) was used as the source of the virus. Nucleic acid extracted from the harvested 
virus, when analyzed by denaturing gel electrophoresis, consistently yielded a doublet RNA containing 
about 1700 nucleotides. Using this RNA as a template, an approximately 164 bp cDNA clone, pkD3, which 
specifically hybridizes to the RNA doublet, was obtained, and its DNA sequence determined (Denniston. 

75 KJ., et al. Science (1986) 232 :873-975). Based on this determined DNA sequence, provided in advance of 
publication, two complementary synthetic oligomers were prepared, only one of which hybridizes to the 
doublet RNA. 

The hybridizing oligomer was then used to probe a cDNA library that was prepared according to the 
Okayama/Berg method from the doublet RNA, resulting in clone 51, containing a 570 bp insert, which 
20 hybridized to the RNA doublet and was used as a probe to obtain overlapping clone 52 from the same 
library. 

Additional clones 54 and 51 15 were obtained by probing with the 51 clone a cDNA library prepared in 
pBR322 using random priming of the isolated RNA. 51 1 5 was then used as a probe to obtain overlapping 
clones 57a. 53b, and 57b. The independent clones 53b, 54, 57a, 57b, and 5115, along with 51 and 52 provided 
25 the complete sequence of the circular single-stranded 1679 nucleotide RNA diagrammed in Figure 1. 

The description of the method to retrieve the entire HDV genome is, of course, mostly of historical 
interest. The resultant sequence (and therefore, also, 0 its complement) is provided herein, and the entire 
sequence, or any portion thereof, could also be prepared using synthetic methods, or by a combination of 
synthetic methods with retrieval of partial sequences using methods similar to those here described. 

30 

B.2. Preparation of Viral Polypeptides and Their Fragments 

The availability of the entire genomic sequences permits construction of expression vectors encoding 

35 antigenically active regions of the 5 antigen, and any other viral polypeptide encoded by the genome or its 
complement Fragments encoding the desired proteins are obtained from the cDNA clones using conven- 
tional restriction digestion or by synthetic methods and are ligated into vectors, for example, containing 
portions of fusion sequences such as 0-galactosidase or superoxide dismutase (SOD), preferably SOD. Any 
desired portion of the HDV genome containing an open reading frame, in either sense strand, can be 

40 obtained as a recombinant protein, such as a mature or fusion protein, or can be provided by chemical 
synthesis or general recombinant means. 

The DNA encoding the desired polypeptide, whether in fused^or mature formV'and whether or not 
containing a signal sequence to permit secretion, may be ligated into expression vectors suitable for any 
convenient host Both eukaryotic and prokaryotic host systems are presently used in forming recombinant 

45 polypeptides, and a summary of some of the more common control systems and host cell lines is given in 
section C.1 herein below. The polypeptide is then purified from lysed cells or from the culture medium and 
purified to the extent needed for its intended use. Such peptides can be used as diagnostics or formulated 
into vaccines. Antibodies raised against these polypeptides can also be used as diagnostics. 

Analysis of the genome shows the presence of a number of open reading frames (ORFs), at least one 

so of which, ORF5, encodes the 6 antigen. Others may encode previously unknown viral polypeptides. Several 
such frames containing a minimum of about 150 nucleotides preceded by an ATG start codon were 
identified. Additional reading frames are present with longer open sequences, but without ATG start codons. 
The reading frames were found both in the cDNA strand having the same sense as the genome, and in the 
antigenome strand. 

55 
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Five of the large ORFs encoding polypeptides containing a methionine proximal to the amino t rminus 
were expressed in bacteria. Only polypeptides encoded by the antigenomic ORFS cross-reacted with 
antisera obtained from patients with hepatitis 5 infections. Based upon immunological analyses using viral 
extracts and recombinant ORF polypeptides synthesized in bacteria and yeast, ORFS encodes the 

5 immunogenic epitopes shared by both hepatitis 5 viral polypeptides p27* and p24 fi , and probably 
represents the complete structural gene for p27* and p24 5 . Based upon immunocom petition studies 
described herein, the nuclear hepatitis 5 antigen is comprised of both p27 5 and p24 5 . 

A comparison of cDNA nucleotides sequences in' clones 115, 7a, 1, 4. 2, 7b, and 3b showed that there 
is a small degree of heterogeneity in the overlapping sequences (see Table 2). Nucleotide sequence 

io heterogeneity is not unusual in RNA containing viruses. Holland, J., et al (1982) Science 215 :1577. The 
sequence heterogeneities at portion 608 of ORFS in particular may be the basis for the distinction between 
p27^ and p24 5 , i.e., the size of the two polypeptides may result from tbe additional amino acids in the C- 
terminal portion of D27* If the position is occupied by G, the triplet containing it encodes tryptophan, and 
translation continues until the opai stop codon beginning at position 664 (see Figure 3). Alternatively, if 

75 position 608 contains A, the triplet containing it encodes an amber stop codon, and translation ceases at 
this point unless the last cell has the ability to suppress the amber codon, thereby allowing translation to 
continue to the opal codon. 

20 B.3. Preparation of Antigenic Polypeptides and Conjugation with Carrier 

The antigenic region of peptides is generally relatively small— typically 10 amino acids or less in length. 
Fragments of as few as 5 amino acids may typically characterize an antigenic region. These segments may 
correspond to regions of 5 antigen or to regions of additional encoded marker polypeptides. Accordingly, 
25 using the genome of HDV as a basis, DNAs encoding short segments of peptides can be expressed 
recombinantly either as fusion proteins or as isolated peptides. In addition, short amino acid sequences can 
be chemically synthesized conveniently. In instances wherein the synthesized peptide is correctly config- 
ured so as to provide the correct epitope, but too small to be immunogenic, the peptide may be linked to a 
suitable carrier. 

30 A number of techniques for obtaining such linkage are known in the art, including the formation of 
disulfide linkages using N-succinimidyl-3-(2-pyridyrthio)propionate (SPDP) and succinimidyl 4-(N-maleimido 
methyl) cyclohexane-1 -carboxylate (SMCC) obtained from Pierce Company, Rockford, Illinois. (If the 
peptide lacks a sulfhydryl, this can be provided by addition of a cysteine residue.) These reagents create a 
disulfide linkage between themselves and peptide cysteine residues on one protein and an amide linkage 

35 through the e-amino on a lysine, or other free amino group in the other. A variety of such disulfide/amide- 
forming agents are known. See, for example, Immun Rev (1982) 62:185. Other Afunctional coupling agents 
form a thioether rather than a disulfide linkage. Many of these thioether-forming agents are commerciaJly 
available and include reactive esters of 6-maJeimidocaproic acid, 2-bromoacetic acid, 2-iodoacetic acid, 4- 
(N-maleimido-methyl) cyclohexane-1 -carboxylic acid, and the like. The carboxyl groups can be activated by 

40 combining them with succinimide or 1 -hydroxy-2-nitro-4-sulfonic acid, sodium salt The foregoing fist is not 
meant to exhaustive, and modifications of the named compounds can clearly be used. 

Any carrier may be used, which does not itself induce the production of antibodies harmful to the host, 
such as the various serum albumins, tetanus toxoids, or keyhole limpet hemocyanin (KLH). 

The conjugates, when injected into suitable subjects, will result in the production of antisera which 

45 contain immunoglobulins specifically reactive against not only the conjugates, but also against fusion 
proteins carrying the analogous portions of the sequence, and against appropriate determinants within 
whole HDV. 



50 B.4. Preparation of Hybrid Particle Immunogens Containing HDV Epitopes 

The immunogenicity of the epitopes of HDV may also be enhanced by preparing them in mammalian or 
yeast systems fused with particle-forming proteins such as that associated with hepatitis B surface antigen. 
Constructs wherein the HDV epitope is linked directly to the particle-forming protein coding sequences 
55 produce hybrids which are immunogenic with respect to the HDV epitope. In addition, all of the vectors 
prepared include epitopes specific to hepatitis B virus (HBV). having various degrees of immunogenicity, 
such as, for example, th pre-S peptide. Thus, particles constructed from particle-forming protein which 
include HDV sequences are immunog nic with respect to both HDV and HBV. 
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Hepatitis surface antigen (HBsAG) has been shown to be formed and assembled in S. cerevisiae - 
(Valenzuela et al. Nature (1982) 298:344-350), as welt as in, for example, mammalian cells (Valenzuela, P.. 
et al, Hepatitis B (1984), Millman, L, et al, ed, Plenum Press, pp. 225-236). The formation of such particles 
has been shown to enhance the immunogenicity of the monomer subunrt The constructs may also include 

s the immunodominant epitope of HBsAg. comprising the 55 amino acids of the presurface (pre-S) region. 
(Neurath et al, Science (1984) 224:392-394.) Constructs of the pre-S-HBsAg particle expressible in yeast 
are disclosed in U.S. Serial No. 621,756, filed 18 June 1984; hybrids including heterologous viral sequences 
for yeast expression are disclosed in U.S. Serial No. 650,323. filed 13 September 1984. Both applications 
are assigned to the herein assignee and incorporated by reference. These constructs may also be 

w expressed in mammalian cells such as Chinese hamster ovary cells using an SV40-dihydrofolate reductase 
vector (Michelle et al, ]nt Symp on Viral Hepatitis (1984)). 

In addition, portions of the particle-forming protein coding sequence per se may be replaced with 
codons for an HDV epitope. In this replacement, regions which are not required to mediate the aggregation 
of units to form immunogenic particles in yeast or mammals can be deleted, thus eliminating additional 

75 hepatitis B antigenic sites from competition with the HDV epitope. 



B.5. Preparation of Vaccines 

20 Preparation of vaccines which contain peptide sequences as active ingredients is also well understood 
in the art. Typically, such vaccines are prepared as injectables, either as liquid solutions or suspensions; 
solid forms suitable for solution in, or suspension in, liquid prior to injection may also be prepared. The 
preparation may also be emulsified or the protein encapsulated in liposomes. The active immunogenic 
ingredient is often mixed with excipients which are pharmaceutically acceptable and compatible with the 

25 active ingredient. Suitable excipients are, for example, water, saline, dextrose, glycerol, ethanol, or the like 
and combinations thereof. In addition, if desired, the vaccine may contain minor amounts of auxiliary 
substances such as wetting or emulsifying agents, pH buffering agents, or adjuvants which enhance the 
effectiveness of the vaccine. The vaccines are conventionally administered parenterally, by injection, for 
example, either subcutaneously or intramuscularly. Additional formulations which are suitable for other 

30 modes of administration include suppositories and, in some cases, oraJ formulations. For suppositories, 
traditional binders and carriers may include, for example, polyalkaline glycols or triglycerides; such 
suppositories may be formed from mixtures containing the active ingredient in the range of 0.5% to 10%, 
preferably l%-2%. Oral formulations include such normally employed excipients as, for example, phar- 
maceutical grades of mannitol, lactose, starch magnesium stearate, sodium saccharine cellulose, magne- 

35 sium carbonate and the like. These compositions take the form of solutions, suspensions, tablets, pills, 
capsules, sustained release formulations or powders and contain 10%-95% of active ingredient, preferably 
25%-70%. 

The proteins may be formulated into the vaccine as neutral or salt forms. Pharmaceutically acceptable 
salts, include the acid addition salts (formed with the free amino groups of the peptide) and which are 
40 formed with inorganic acids such as, for example, hydrochloric or phosphoric acids, or such organic acids 
as acetic, oxalic, tartaric, mandelic. and the like. Salts formed with the free carboxyl groups may also be 
derived from inorganic bases such as, for example, sodium, potassium, ammonium, calcium, or ferric 
hydroxides, and such organic bases as isopropylamine, trimethy (amine, 2-ethylamino ethanol, histidine, 
procaine, and the like. 

45 The vaccines are administered in a manner compatible with the dosage formulation, and in such 
amount as will be therapeutically effective and immunogenic. The quantity to be admtnistered depends on 
the subject to be treated, capacity of the subject's immune system to synthesize antibodies, and the degree 
of protection desired. Precise amounts of active ingredient required to be administered depend on the 
judgment of the practitioner and are peculiar to each subject. It should be noted that since 5 infection is 

so dependent on infection with hepatitis B, a subpopulation for which an anti-5 vaccine is particularly useful is 
the pool of hepatitis B carriers. It may also be beneficial to construct "dual" vaccines containing both B and 
D antigens. 

The polypeptides encoded within ORF5 (and peptides derived therefrom) are particularly suitable 
vaccine components for protection against HDV infection, despite the fact that ORF5 encodes core antigens 
55 of the HDV particle. Vaccines containing recombinantly produced core antigens of HBV are effective in 
protecting against or alleviating hepatitis B infection. Murray, K., et al, EMBO J (1984) 3:645. 
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B.6. Preparation of Antibodies Against HDV Epitopes 

The immunogenic proteins prepared as described above are used to immunize mammals. The resulting 
antisera are useful as diagnostic reagents. Also lymphocytes or splenocytes from these animals may be 
5 used to prepare hybridomas capable of secreting monoclonal antibodies directed against these epitopes 
and cross-reactive against the infective virus. The resulting monoclonal antibodies are particularly useful in 
diagnosis, and those which are neutralizing are useful in passive immunotherapy. 

The polypeptides encoded within ORF5, and antibodies to these polypeptides are particularly useful for 
immunodlagnosis of HDV. As discussed below, ORF5 encodes the 5 antigen, which apparently is comprised 
10 of two viral polypeptides, p24* and P27 5 . 



B.7. Diagnostic Oligonucleotide Probes and Kits 

is Using the disclosed family of HDV genomes as a basis, oligomers of approximately 8 bp or more can 
be prepared, either by excision or synthetically, which hybridize with the HDV genome and are useful in 
detection of the virus in diseased individuals. While 8 bp is a workable length, sequences of 10-12 bp are 
preferred, and about 20 bp appears optimal. Preferably these sequences will derive from regions which lack 
the heterogeneity. These probes can be prepared using routine methods, including automated 

20 oligonucleotide synthetic methods. Among useful probes, for example, are the clone 51, the various 
oligomers useful in probing cDNA libraries set forth below, and the additional clones disclosed herein. 
Particularly useful are those clones containing fragments of ORF5. Any portion of the genome or its 
complement will be satisfactory. For use as probes, complete complementarity is desirable, though it may 
be unnecessary as the length of the fragment is increased. 

25 For use of such probes as diagnostics, the biological sample to be analyzed, such as blood or serum, is 
treated, if desired, to extract the nucleic acids contained therein, and the resulting nucleic acid subjected to 
gel electrophoresis or other size separation technique or simply dot blotted without size separation. The 
probes are then labeled, using, for example, nick translation or kinasing, and the extracted nucleic acids 
then treated with labeled probe under suitable hybridization stringencies. 

30 Since the probes can be made completely complementary to the viral RNA, high stringency conditions 
are desirable in order to prevent false positives. However, high stringency conditions should only be used if 
the probes are complementary to regions of the viral genome which lack heterogeneity. The stringency of 
hybridization is determined by a number of factors, including temperature, ionic strength, length of time 
permitted for hybridization and for washing, and concentration of formamide. These factors are outlined, for 

35 example, in Maniatis, T., et al, Molecular Cloning: A Laboratory Manual (1982), Cold Spring Harbor Press, 
Cold Spring Harbor, New York. Increased stringency can be achieved, for example, by raising the 
temperature, shortening the time of exposure, and adjusting the ionic strength. 

The probes can be packaged into diagnostic kits which include the labeled DNA, suitably packaged, 
additional reagents, and materials needed for the particular protocol, and instructions for conducting the 

40 test 



B.8. Immunoassay Diagnostic Kits 

45 Both the polypeptides which react immunologically with serum containing HDV antibodies, e.g.. the 
ORF5-encoded polypeptides, and the antibodies raised against these polypeptides are useful as compo- 
nents of diagnostic kits designed to detect the presence of HDV antibodies in blood or serum samples or to 
detect the presence of the virus, as the case may be. Design of the immunoassays is subject to a great 
deal of variation, and several protocols based on competition or direct reaction on solid supports or on 

so immunoprecipitation, for example, are available. Most assays involve the use of labeled antibody or 
polypeptide containing fluorescent, radioactive or dye molecules as tags. Enzyme-labeled and mediated 
immunoassays are also commonly used. Therefore, kits suitable for use in such protocols and containing 
the appropriate labeled reagents are constructed by packaging the appropriate materials, including the 
antibodies or polypeptides of the invention in suitable containers along with the remaining requirements for 

55 conduct of the assay and a suitable set of instructions for conducting it 
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C. General Methods 

The general techniques used in extracting RNA from the virus, preparing and probing a cDNA library, 
sequencing clones, constructing expression vectors, transforming cells, and the like are known in the art 
5 and laboratory manuals are available describing these techniques. However, as a general guide, the 
following sets forth some sources currently available for such procedures, and for materials useful in 
carrying them out. 

70 C.1. Hosts and Expression Control Sequences 

Both prokaryotic and eukaryotic host cells may be used for expression of desired coding sequences 
when appropriate control sequences used are compatible with the designated host. Among prokaryotic 
hosts, E. coli is most frequently used, mostly for convenience. Expression control sequences for pro- 

75 karyotes include promoters, optionally containing operator portions, and ribosome binding sites. Transfer 
vectors compatible with prokaryotic hosts are commonly derived from, for example, pBR322 a plasm id 
containing operons conferring ampicillin and tetracycline resistance, and the various pUC vectors, which 
also contain sequences conferring antibiotic resistance. The foregoing operons may be used as markers to 
obtain successful transformants by selection. Commonly used prokaryotic control sequences include the j8 

20 lactamase (penicillinase) and lactose promoter systems (Chang, et al, Nature (1977) 198:1056), the 
tryptophan (trp) promoter system (Goeddei. et al, Nucleic Acids Res (1980) 8:4057) and the X derived P L 
promoter and N gene ribosome binding site (Shimatake et al, Nature (1981) 292:128) and the hybrid tac 
promoter (De Boer et al, Proc Natl Acad Sci (USA) (1983) 80:21-25) derived from sequences of the trrj and 
the lac UV5 promoters. The foregoing systems are particularly compatible wfth E coli; if desired other 

25 prokaryotic hosts such as strains of Bacillus or Pseudomonas may be used, with corresponding control 
sequences. 

Eukaryotic hosts include yeast and mammalian cell culture. Saccharomyces cerevisiae , or Baker's 
yeast and Saccharomyces carlsbergensis are the most commonly used yeast hosts, again because of 
convenience. Yeast compatible vectors carry markers which permit selection of successful transformants by 

ao conferring prototrophy to auxotrophic mutants or by conferring antibiotic resistance or resistance to heavy 
metals or wild-type strains. Yeast compatible vectors may employ the 2 micron origin of replication (Broach. 
J., et al, Meth Enz (1983) 101 :307) the combination of CEN3 and ARS1 or other means for assuring 
replication, such as sequences which will result in incorporation of an appropriate fragment into the host cell 
genome. Control sequences for yeast vectors include promoters for the synthesis for glycolytic enzymes 

35 (Hess et al, J Adv Enzyme Reg (1968) 7:149, Holland et al, Biochemistry (1978) 17:4900), and the promoter 
for 3 phospTioglycerate kinase (Hitzeman et al, J Biol Chem (1980) 255:2073). For yeast expression, 
terminators may also be included, such as those derived from the enoiase gene (Holland, M. J., J Biol 
Chem (1981) 256:1385). Particularly useful control systems include those specifically described herein, 
which comprise the glyceraldehyde-3 phosphate dehydrogenase (GAPDH) promoter or alcohol de- 

40 hydrogenase (ADH) regOlatable promoter, terminators aiso derived from GAPDH, and, if secretion is 
desired, leader sequence from yeast alpha factor. These systems are described in detail in U.S. Serial Nos. 
468.589 and 522,909, filed 22 February 1983 and 12 August 1983, respectively, both assigned to the herein 
assignee, and incorporated herein by reference. 

Mammalian cell lines available as hosts for expression include many immortalized cell lines available 

45 from the American Type Culture Collection, including HeLa cells, Chinese hamster ovary (CHO) cells, baby 
hamster kidney (BHK) cells, and a number of other cell lines. Suitable promoters for mammalian cells 
prominently include viral promoters such as that from Simian virus 40 (SV40) (Fiers, et al, Nature (1978) 
273:113) or other viral promoters such as the Rous sarcoma virus (RSV) adenovirus, and bovine papilloma 
virus (BPV). Mammalian cells may also require terminator sequences. Vectors suitable for replication in 

so mammalian cells may include viral replicons, or sequences which insure integration of the appropriate 
sequences into the host genome. 



55 
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C.2. Transformations 

The transformation procedure used depends on the host to be transformed. Bacterial transformation 
generally employs treatment with calcium or rubidium chloride (Cohen, S. N., Proc Natl Acad Sci (USA) 
5 (1972) 69:2110, Maniatis et al, Molecular Cloning: A Laboratory Manual (1982) Cold Spring Harbor Press, 
page 254). Yeast transformations may be carried out using the method of Hinnen et al. Proc Natl Acad Sci - 
(1978) 75:1929-1933. Mammalian transformations are conducted using the calcium phosphate precipitation 
method of Graham and van der Eb, Virology (1 978) 52:546, or the various modifications thereof. 

70 

C.3. Vector Construction 

Vector construction employs techniques which are by now quite well understood. Site-specific DNA 
cleavage is performed by treating with suitable restriction enzymes under conditions which generally are 

rs specified by the manufacturer of these commercially available enzymes (see, e.g., The New England 
Biolabs Product Catalog). In general, about 1 ug of plasmid or DNA sequence is cleaved by 1 unit enzyme 
in about 20 u\ buffer solution for an incubation time of about 1-2 hr at about 37°C. After incubation with the 
restriction enzyme, protein is removed by phenol/chloroform extraction and the DNA recovered by 
reprecipitation with ethanol. The cleaved fragments may be separated using polyacrylamide or agarose gel 

20 electrophoresis techniques, according to the general procedures found in Methods in Enzymology (1980) 
65:499-560. 

Sticky ended cleavage fragments may be blunt ended using E. coH DNA polymerase I (Klenow) in the 
presence of the appropriate deoxy nucleotide triphosphates (dNTPs) using incubation conditions appropriate 
to the polymerase. The polymerase digests protruding 3 single strands, but fills in 5' protruding ends, 
25 according to the dNTPs present in the mixture. Treatment with S1 nuclease may also be used, as this 
results in hydrolysis of any single stranded DNA portion. 

Ligations are carried out using standard buffer and temperature conditions using T4 DNA ligase, and 
ATP; sticky end ligations require less ATP and less ligase than blunt end ligations. When vector fragments 
are used as part of a ligation mixture, the vector fragment is often treated with bacterial alkaline 
ao phosphatase (BAP) in order to remove the 5' phosphate and thus prevent religation of the vector; 
alternatively, restriction enzyme digestion of unwanted fragments can be used to prevent religation. 

Ligation mixtures are transformed into suitable cloning hosts, such as E. coli , and successful transfor- 
mants selected by, for example, antibiotic resistance, and screened for the correct construction. 

35 

C.4. Construction of Desired DNA Sequences 

Synthetic oligonucleotides may be prepared using an automated oligonucleotide synthesizer as de- 
scribed by Warner, B.D., et al, DNA (1984)3:401-411. If desired, these synthetic strands may be kinased for 

40 labelling with by using an excess of polynucleotide kinase in the presence of labeled ATP, under 
standard kinasing conditions. 

DNA sequences including those isolated from genomic or cDNA libraries may be modified by site 
directed mutagenesis, as described by Zoller, M., et al, Nucleic Acids Res (1982) 10:6499. Briefly, the DNA 
to be modified is packaged into phage as a single stranded sequence, and converted to a double stranded 

45 DNA with DNA polymerase using, as a primer, a synthetic oligonucleotide complementary to the portion of 
the DNA to be modified, and having the desired modification included in its own sequence. The resulting 
double stranded DNA is transformed into a phage supporting host bacterium, and cultures of the 
transformed bacteria, which will contain replications of each strand of the phage, are plated in agar to obtain 
plaques. Theoretically 50% of the new plaques will contain phage having as a single strand the mutated 

so form; 50% will have the original sequence. Replicates of the plaques are hybridized to kinased synthetic 
probe at temperatures and conditions which permit hybridization with the correct strand, but not with the 
unmodified sequence. The thus identified, desired, modified sequences are then recovered and cloned to 
serve as sources for the desired DNA. 

55 
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C-5. Hybridization with Probe 

DNA libraries are probed using the procedure of Grunstein and Hogness ( Proc Natl Acad Sci (USA) 
(1975) 73:3961). Briefly, in this procedure, the DNA to be probed is immobilized on nitrocellulose fitters, 

s denatured, and prehybridized with a buffer containing 0-50% formamide, 0.6 M NaCI, 60 mM sodium 
citrate, 0.02% (wt/v) each of bovine serum albumin, polyvinyl pyrollidine, and Ficoll, 50 mM sodium 
phosphate (pH 6.5), 1% glycine, and 100 u.g/ml carrier denatured DNA. The percentage of formamide in the 
buffer, as well as the time and temperature conditions of the prehybridization and subsequent hybridization 
steps depends on the stringency desired. Oligomeric probes which require lower stringency conditions are 

w generally used with low percentages of formamide, lower temperatures, and longer hybridization times. 
Probes containing more than 30 or 40 nucleotides such as those derived from cDNA or genomic sequences 
generally employ higher temperatures, e.g. about 40-42° and a high percentage, e.g. 50% formamide. 
Following prehybridization this same buffer, now containing the ^P kinased oligonucleotide probe, is added 
to obtain hybridization. Radioautography of the treated filters shows the location of the hybridized probe, 

75 and the corresponding locations on replica filters which have not been probed can then be used -as the 
source of the desired DNA. 



C. 6. Verification of Construction and Sequencing 

20 

For routine vector constructions, ligation mixtures are transformed into E. coli strain HB101 or other 
suitable host and successful transform ants selected by antibiotic resistance or other markers. Plasmids 
from the transform ants are then prepared according to the method of Clewell, D. B„ et at Proc Natl Acad 
Sci (USA) (1969) 62:1159. usually following chloramphenicol amplification (Clewell, D. B., J Bacteriol (1972) 
25 110:667). The isolated DNA is isolated and analyzed by restriction analysis, or sequenced by the dideoxy 
method of Sanger, F., et al. Proc Natl Acad Sci(USA) (1977) 74 :5463, as further described by Messing, et 
aI - Nucleic Acids Res (1981) 9:309, or by the method of Maxam et al, Methods in Enzymology (1980) 
65:499. To overcome problems with band compression, which are sometimes observed in GC rich regions, 
T-deazoguanosine was used. Barr, P., et al Biotechniques (1 986) 4:428. 

30 

D. Examples 

The following examples are intended to illustrate but not to limit the invention. The procedures set forth, 
35 for example, in flD.1 may, if desired, be repeated but need not be, as techniques are available for 
construction of the desired nucleotide sequences based on the information provided by the invention. 
Expression is- exemplified in E. colt and yeast however, other systems are available as set forth more fully 
in UC.1. Additional epitopes derived from the genomic structure may also be produced, and used to 
generate antibodies as set forth below. 



D.1 • Preparation of HDV cDNA -u^.^ 

Chimpanzee serum containing approximately 10 11 chimp infectious doses/ml of 5 agent was ultracen- 
45 trifuged and the nucleic acid was extracted from the resulting pellet after incubation with proteinase K. 
Briefly, the RNA was extracted from the virions by conventional procedures, for example, that disclosed by 
Ticehurst, J. E., et al, Proc Acad Sci (USA) (1983) 80:5885-5889, including protease treatment and 
phenol/chloroform extraction, following by ethanol precipitation. HDV was centrifuged through 20% sucrose 
in 20 mM HEPES pH 7.5 and 0.1% BSA. After proteolytic digestion with 1 mg/ml proteinase k, 50 ug/ml 
50 yeast transfer RNA, 20mM HEPES pH 7.5, 50 mM EDTA, 20 mM NaCI and 1% SDS overnight at 37°C t 
RNA was purified by phenol/CHCb extraction and precipitation with ethanol. 

The nucleic acid was analyzed using denaturing gel electrophoresis to obtain a 1700 nucleotide RNA 
doublet as determined by hybridization analysis. The doublet was used by Denniston, K.J., et at Science - 
(1986) (supra), to obtain an approximately 164 bp cDNA clone, pkD3, which specifically hybridizes to the 
55 doublet, as well as to samples infected with 5 agent. 



11 



0 251 575 



Two complementary oligonucleotides were synthesized using the sequence information obtained from 
the Denniston et al pkD3 cDNA clone as a basis. Probe 1: 5'-GATGCCCTTCCCGATGCTCGATTCCGACTC 
and Probe 2: 5-GAGTCGGAATCGAGCATCGGGAAGGGCATC were labeled by kinasing using 200 uCi 32. 
[P] ATP, >5 Ci/umoL Probes were kinased at the 5' terminus with T4 kinase according to the method of 

s Lillehaug, et al ( Biochemistry (1976) 15:1858 followed by purification on a Sep-pak C18 cartridge (Millipore) 
using elution with 50% v/v CH3OH, 50 mM ammonium acetate, pH 7.5. 

For hybridization to DNA probes, HDV RNA was electrophoresed through a 1% agarose-form aldehyde 
gel along with control chimpanzee RNA and DNA size markers (Lehrach, H., et al, Biochemistry (1977) 
16:4743). Each gel was blotted onto a nitrocellulose membrane and hybridized to labeled specific probe as 

10 described by Thomas, Proc Natl Acad Sci (USA) (1980) 77:5201. Treatment of gels containing the template 
RNA and suitable controls with each of these probes showed that only Probe 2 hybridized to the template, 
confirming the single stranded nature of the genome. 

A cDNA library was prepared from the original RNA extract of the chimpanzee serum pellet by the 
method of Okayama and Berg (Mo[ Cell Biol (1982) 2:161-170), after attaching poly(rA) tails to the 3'- 

75 hydroxy terminus of the RNA. The RNA showed extensive degradation during the incubation with the pofy- 
(rA) polymerase. However, probing the resulting cDNA library with Probe 2 resulted in the retrieval of a 
clone, 51, which has the sequence shown in Figure 4. A smaller (250 bp) overlapping- clone, 52, was also 
found in the cloned cDNA of 51 . 

Strand-specific probes were prepared from 51 using a -950 bp Pvu l I /Hin d Ml restriction fragment 

20 (containing flanking regions) or a -450 bp Pvull/Pstl fragments, in order to identify the genomic and 
complementary strands oT the cDNA. These fragments were ligated into M13 vectors to generate com- 
plementing single-stranded 5 templates. To prepare hybridization probes, 0.8 ug of each template DNA was 
mixed with 0.1 ug of hybridization probe primer (New England Biolabs) in 200 M.M NaCI, followed by 
incubation for 15 minutes at 37°C after denaturing in a boiling water bath for 1 minute. The annealed 

25 mixture was incubated for 2 hours at 15°C and 200 ul containing 50 mM Tris-CI, pH 7.5, 5 mM MgCk 10 
mM 0-mercaptoethanol, 50 ug/ml BSA, 0,1 mM dATP, dGTP, and dTTP. 14 uM dCTP (1000 Ci/mmol), 
along with 250 U/ml Klenow to label the single-stranded inserts. The reaction was stopped and the DNA 
purified on G50 Sephadex and the resulting probe eluting in void volume, was used to hybridize to a 
Northern blot containing the labeled template RNA. 

30 The results for a successful probe (one of the -450 bp Pvu ll/Pstl fragment strands) are shown in Figure 
5. Lane 1 contains labeled markers, lane 2 contains 10 ng 5 virion RNA from plasma, lanes 3 and 4 contain 
1 .4 ug of liver RNA from control and infected chimpanzees, respectively. Lanes 2 and 4 clearly show the 
presence of viral nucleic acid. 

An additional HDV cDNA library was prepared by using calf thymus random primers (Taylor, J.M., et al, 

35 Biochem Biophys Acta (1976) 442 :324-3300) to prime reverse transcription of HDV RNA. The resulting 
single-stranded cDNA was then purified and rendered double-stranded by incubation with E. coji DNA 
polymerase L Following treatment with SI nuclease, the cDNA was tailed with oligo-dC using terminal 
transferase and annealed with dG-tailed pBR322 that had been previously restricted with Pstl. The plasmids 
were then transformed into the host bacterium B coli MC1 051 , and tetracycline-resistant recombinants were 

40 colony-hybridized as described below to screen for 5 clones. (These general methods are described in 
Maniatis, T. f et al, in Molecular Cloning (Cold Spring Harbor Laboratory) pp. 229-242 (1982).) 

The 435 bp Ncol fragment from the cDNA insert of 51 was nick-translated-and used to screen the above 
random-primed cDNA library to obtain 54 and 5115. A 481 bp Hindlll/Smal fragment of the cDNA Insert in 
5115 was used to screen this library to obtain 57a. Clones 53b and 57b were obtained using an 

45 oligonucleotide probe based on a sequence from 51 15 (5-TGGAACGTCGGAGAAAC-30- 

Thus additional clones were retrieved from this library, as follows: 53b (829 bp), 54 (1123 bp), 57a (474 
bp), 57b (1378 bp), and 5115 (1362 bp). When these clones, and 51 and 62, were sequenced, overlapping 
portions of the genome, as illustrated in Figure 1 , were obtained. The sequencing data strongly suggested 
that the original HDV RNA was a circular molecule since the sequences of the 7 different cDNA clones 

50 could not be fitted into a linear molecule of only -1700 nucleotides in length. This hypothesis was 
confirmed by visualizing circular HDV RNA molecules in the electron microscope under denaturing 
conditions. The complete sequence of DNA representing the genome and its complement is shown in 
Figure 2, taking account of the overlapping portions of the various clones. The upper strand represents the 
HDV genomic RNA, the lower its complement There was some sequence heterogeneity between the 

55 various clones, as indicated in Figure 2. 
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The heterogeneities in nucleotide sequence are indicated above the genomic strand. The effect on the 
amino acid encoded is indicated below the complementary strand; AM indicates an amber stop codon. and 
OP indicates an opal stop codon. Table 1 presents a comparison of the heterogeneities in several of the 
clones. 

Table 2 shows putative polypeptides encoded by open reading frames (ORFs) of at least 300 
nucleotides. The position of the first nucleotide in each open 
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reading frame is indicated according to the numbering of the upper strands shown in Figure 2. The upper 
strand, representing the genomic sequence is numbered 1-1679. Positions in the complement have the 
same numbers, but are preceded by x. Polypeptides encoded by regions of the complement thus are given 
with numbers in "reverse" order— e.g., x1619-x1014 for 0RF5. The first nucleotide number in the table is 
that of the first nucleotide in the frame^-not the ATG. The translation^ reading frame or ORF5 is shown in 
Rgure 2 (putative polypeptide p1 of Table 2) and a potential N-glycosylation site is indicated by \ 
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Nucleotide sequence analysis of clones containing the ORF5 region revealed several sequ nc 
heterogeneities in this region. These heterogeneities are indicated in Figure 3, which shows the nucleotide 
sequence of ORFS. The heterogeneities in nucleotide sequence detected from other clones ar listed above 
the nucleotide sequence. The amino acid substitutions resulting from the sequence heterogeneity is listed 

5 above the deduced amino acid sequence. As a result of this heterogeneity in sequence, ORF5 encodes a 
family of closely related polypeptides. 

The heterogeneity at nucleotide position 608 of ORF5 (see Figure 3) is of particular interest since, as 
discussed below, both viral polypeptides of p24 6 and p27* appear to encoded in ORF5. If position 608 
contains an A, the resulting codon is an amber stop codon which would translate (unless the host contains 

to an amber suppressor system) to yield a polypeptide the size of p24 5 . However, if position 608 contains a G 
(of if the host has the ability to suppress the amber mutation), read through of the codon to the opal stop 
signal at position 664 yields a polypeptide the size of p27*. This suggestion is supported by the finding that 
expression of ORF5 in E coli D1210 transformed with porf5 yielded two products which are identifiable with 
the viral antigens p24 5 and p27 5 in terms of size and immunoreactivity (see §D.3). E. coli D1210 contains a 

75 leaky amber suppressor system; thus, a portion of translation terminates at the amber codon. Verification of 
the suggestion can be obtained by substituting G for A at position 608 of the ORF present in porfS. This 
substitution can be accomplished using in vitro site-directed mutagenesis, the techniques of which are 
known to those of average skill in the art. 

The complete genome of HDV represents a 1679 nucleotide circular sequence. It is presumed that the 

20 genomic RNA is single-stranded, as only one of the complementary synthetic oligomers and single- 
stranded 51 M13 probes hybridizes to the template. In addition, the template RNA cannot be translated in 
an in vitro rabbit reticulocyte lysate leading to the possibility that the genome is, in fact, representative of an 
anti-sense strand. 

25 

D-2. Confirmation of Polypeptide Encoding Clones 

The viral RNA derived from infectious plasma was random primed, and the resulting cDNA was cloned 
into the Pstl site of pBR322 using GC tailing as described above. The ligation mixtures were transformed 

30 into E. coli MC1061 and plasmid DNA prepared from a pool of about 20,000 recombinants. The plasmid 
DNA was cleaved with Pstl and the cDNA inserts were eluted from an agarose gel, blunted with Klenow, 
ligated to EcoRI linkers, and then cloned into the phage vector Xgtll (Young, et al, Proc Natl Acad Sci USA 
(1983) 80:1194-1198) at the unique EcoRI site using Y1090(r) as host. This phage-random cDNA library 
was then screened using hybridization to two probes derived from the above-referenced 54 and 5115 

35 clones. In addition, colonies were immunoscreened using antisera derived from humans that were chron- 
ically coinfected with hepatitis B and 5 viruses. 

Several plaques were obtained which bound both the probes and also the antisera One recovered 
plaque was sequenced and contained a cDNA of about 200 bp whose translations reading frame 
corresponded to part of polypeptide p1 translated from the antigenomic strand shown in Table 2. The j3- 

40 galactosidase fusion protein produced by this Xgtll thus contained at its carboxy terminus a region of 
polypeptide p1 that was responsible for the specific binding of 5 antiserum. Control antisera from previous 
infections with hepatitis A, B, and non-A/non-B did not bind to this fusion protein. Accordingly, p1 evidently 
contains an antigenic region capable of specific binding to 5-infected antisera and thus is useful in 
diagnosis. 

45 

D.3. Construction of Expression Vectors and Expression of HDV Sequences 

The HDV genome and the complement contain a number of ORFs (see §D.1). Several of these ORFs 
so have been expressed, and the antigenicity of the encoded polypeptides examined with respect to their 
ability to bind to HDV antiserum. Figure 7 is a diagramatic representation of HDV ORFs. All HDV ORFs 
greater than 300 nucleotides beginning with an ATG are aligned with the circular coordinates of the HDV 
genome. The thick lines represent the portion of each ORF expressed in bacteria The triangles ( ) denote 
the first in-frame ATG of each ORF. Arrows indicate translation of the genome or antigenomic strand, 
65 clockwise or counter-clockwise, respectively. Coordinates of each entire ORF, the region expressed in 
bacteria and the relative translational frame ar compiled in table form. 
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D.3.a. The Expression in E coli of Fusion Proteins Containing HDV Polypeptides Encoded in ORF5 and 
ORF6 

Bacterial expression plasmids were constructed which directed the synthesis of fusion proteins 
containing human superoxide dismutase (SOD) (Hallewell, et al, Nucleic Acids Res (1985) 1_3:2017) and also 
portions of HDV protein encoded within ORF5 or 0RF6, i.e., p1 and p2, respectively, The plasmids 
synthesized most of the ORF5-encoded p1 or ORF6-encoded p2 fused to the carboxy terminus of SOD. 

The expression plasmids were based on the tac promoter driven expression piasmid pSOD16 of 
Hallewell et al (supra). Piasmid pSOD16cf2 was generated from pSOD16 by replacement of a portion of the 
carboxy terminal coding region of the SOD gene and downstream polylinker sequences through the Mbol 
site by the new polylinker sequence 

5 * GATCGCCATGGGTACCCGGGTCGACTAAATGACTAG 3 ' 

3 1 CGGTACCCATGGGCCCAGCTGATTTACTGATCTTAA S 1 

The substitution of this polylinker sequence results in the removal of the natural carboxy terminal Gin of 
SOD. 

To insert the sequence derived from the HDV genome, the method of Steimer et al, J Virol (1986) 58:9- 
16. was followed, !>SOD16cf2 was suitably digested in order to accommodate the particular coding 
sequence desired as described below. 

For p1, the recovered DNA clone, 6115, was digested with Sst II, blunted with Klenow, and then 
digested with Sail to recover a 600 bp fragment isolated from an agarose gel. The isolated fragment was 
ligated into pSOD16cf2 which had been digested with Ncol. blunted, and then digested with Sail to yield 
pSOD-5p1. The fusion protein encoded containing 205 residues of the p1 amino acid sequence encoded by 
nucleotides x1 567 to x963. 

For the p2 protein, the recombinant DNA piasmid S4 was digested with Eco RI and Smal to recover a 
622 bp fragment which was ligated into EcoRI/SmakJigested pSOD16cf2 to yield pSOD-5p2. 

(Both of the resulting plasmids were sequenced to confirm the location and orientation of the p1 and p2 
encoding sequences at the C-terminus of the SOD protein.) 

The ligation products were transformed into E. coli D1 210 (Sadler et al. Gene (1980) 8:279-300). Single 
colony transformants were grown overnight at 37°C in 2 ml L-broth plus 100 ug/mi ampicillin. Glycerol 
(50%) stocks of these cultures were prepared and stored at -20 °C. 

For protein expression analysis, overnight cultures, in medium as above, were begun from glycerol 
stocks. These cultures were diluted 1/100 into the same medium and grown at 37°C to an OD©o of 0.6 
when aliquots were either rysed or induced for maximum expression by the addition of 1 mM iPTG and 
further incubation for 4 hours prior to lysis. 

Cells were lysed in the presence of SDS and DTT for analysis on denaturing polyacrylamide gels 
(Laemmli, Nature (1970) 277:680) and were immunoblotted according to Towbin et al, Proc Natl Acad Sci 
USA (1979) 76:4350. The results are shown in Figure 6. 

Immunoblots were reacted with 5 antiserum from chronically infected patients (panel A) or control 
antisera (panel B) containing antisera infected with non-5 hepatitis viruses. In addition, after prebinding with 
5% goat serum, the immunoblots were reacted with a 1:300 dilution of antisera diluted in 1 x PBS 
containing 0.3% Tween-20 and 5% goat serum, followed by incubation with 1:200 dilution of horseradish 
peroxidase-conjugated goat antihuman IgG and the blot was developed in the presence of the chromogen 
4-chloro-1-naphthol (Biorad). 

In Figure 6 lanes 1-4 contained extracts of cells containing the pSOD-5p1 recombinant vector; lanes 5 
and 6 contained extracts from cells transformed with the host vector; lanes 7-10 contained the correspond- 
ing pSOD-5p2 recombinant vectors. The samples of lanes 3, 4, 6, 9, and 10 were from cultures uninduced 
with IPTG; those from the remaining lanes, 1. 2, 5. 7, and 8 were from cultures further induced with IPTG. 
The presence of additional protein bands in lanes 1-4 as compared to lanes 5-10 shows the production of 
an antigenically reactive protein from pSOD-5p1, designated SOD-p1, but not from pSOD-5p2. Thus, ORF5 
but not ORF6 encodes protein which specifically bind human HDV antiserum. The failure to detect specific 
immunoreactive ORF6 fusion polypeptides was not due to a lack of expression in the bacterial host since, 
when monitored for binding to rabbit antiserum raised against human superoxide dismutase, the products 
expressed from pSOD-5p1 and pSOD-fip2 were present at similar levels. 
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As seen in Rgure 6a ( there are predominantly two translation products from pSOD^pl which are 
immunoreactive with HDV antiserum. The estimated size of the largest major immunoreactive ORF5 
polypeptide is 48.000 dattons, which is consistent with a fusion polypeptide containing 154 amino acids of 
superoxide dismutase and 205 amino acids specified by ORF5. This polypeptide may result from 

5 suppression of the amber codon in ORF5 (see Rgure 2). The amber codon is present in pSOD-5p1, and the 
host strain, E. coli D1210, is an amber suppressor strain. The second major polypeptide product, which is 
smaller, may have resumed from leakiness in the suppression, thus allowing termination at the amber codon. 
Other possible alternative explanations are that the smaller protein(s) may result from postradiational 
processing of a single product, or that there are alternate initiation sites within the ORF5 coding region. 

70 A sample of E coli strain D1210 (pSOD-5p1) has been deposited with the American Type Culture 
Collection (ATCC), 12310 Parklawn drive, Rockville, MD 20852, and has been assigned Accession No. 
67131. This deposit will be maintained under the conditions specified in the Budapest Treaty. 



75 D.3.b. The Expression in E coli of Fusion Proteins Containing HDV Polypeptides Encoded in QRFs 1^ 2, 
and 7. 

Bacterial expression vectors which directed the synthesis of fusion proteins containing portions of SOD 

and of HDV proteins encoded with ORFs 1, 2. and 7, i.e., the vectors pSOD-orfl, pSOD-orf2, and pSOD- 
20 orf7, were constructed. The construction conditions, and sequencing, were as described for pSOD-5p1 and 

pSOD-5p2 in § D.3.a.T'except for the following. 

For pSOD-orf 1 , the 436 b.p. insert fragment was isolated from clone 51 by digestion of the plasmid with 

Ncol, followed by gel purification. This fragment was ligated to Ncol treated, phosphatased, pSOD16cf2. 

The ORF1 fragment in the clone has the genomic orientation. 
25 For pSOD-orf2, the 593 b.p. insert gel purified fragment was isolated after digestion of clone 5115 with 

BstXI. followed by treatment with Klenow, and then digestion with EcoRI. This fragment was ligated to 

pSOD16cf2 which had been digested with Nco l, blunt ended with Klenow, and digested with EcoRI. 

For pSOD-orf7, a 439 bp insert gel purified fragment was isolated after digestion of clone 5115 with Alul 

and Sma l. This fragment was ligated to pSOD16cf2 which had been Smal digested and phosphatased. 
30 Proteins expressed in pSOD-orfl, pSOD-orf2, and pSOD-orf7 were analyzed by immunoblot as 

described for pSOD-5p1 and pSOD-5p2 (see § D.3.a.). 

The expression conditions were also as described in § D.3.a. The presence of ORF1. 2. and 7 hSOD 

fusion products in the bacterial lysates was demonstrated by partial reactivity with rabbit anti-hSOD 

polyclonal antibodies against hSOD. Lysates of bacterial cultures expressing each of the ORFs were 
35 immunoblotted onto nitrocellulose and incubated with individual antisera from 12 different patients with 

chronic HDV infections. The products expressed from pSOD-orf 1 . pSOD-orf2, and pSOD-orf7 did not bind 

to HDV antisera, although a product expressed from pORF5, the construction of which is described in § 

D.3.C., did bind the HDV antisera 

40 

D.3.C. The Expression in E. coli of Unfused HDV Polypeptides Encoded in ORF5 



A bacterial expression plasmid was constructed which directed the synthesis of unfused ORF5 encoded 
polypeptides. This vector, porfS, was similar to that used to express fused sod-orf polypeptides (pSOD-5pl , 

45 see § D.3.a.), except that it contained a second synthetic linker designed to terminate translation after the 
hSOD coding sequence and to reinitiate translation at the first ATG of the HDV sequence. This linker 
encodes 10 amino acids originally present in ORF5. including the amino terminal ATG. More specifically, 
the vector was constructed by ligating together the following: a) a 605 b.p. Sstll/Sall fragment which was 
restricted from 5115 and gel purified; b) the second linker; and c) the large vector fragment obtained by 

so treating pSOD-5p2 with Nco l and Sail. The linker sequence was: 

5 1 CATG GCT ACA GAG GAA TTA TAAT ATG AGC CGG TCC 
3' CGA TGT CTC CTT AAT ATTA TAC TCG GCC AGG 

55 

GAG TCG AGG AAG AAC CGC 3 ' 
CTC AGC TCC TTC TTG G S ' 
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Transformation of E coii D1210 with the plasmid porfS was as described in §D.3.a for transformation 
with other piasmids. The construction of the insert in porf5 was confirmed by DNA sequence analysis. This 
analysis also confirmed the presence of the amber codon in the 5115 derivative of ORF5 (see Figure 3 for 
the ORF5 sequence heterogeneities). 

s Expression of ORF5 polypeptides encoded within porf5, and immunoreactivity of the expressed 
products with HDV antisera was carried out as described in §D.3.a., and was simultaneous with the analysis 
of the expressed products from pSOD-orf 1 , pSOD-orf2, and pSOD-orf6, and pSOD-orf7. As seen in Figure 
8, which shows an immunoblot, only the 0RF5 encoded polypeptides bound to HDV antisera, and these 
polypeptides did not bind to antisera from uninfected individuals. 

io For the immunoblot analysis in Figure 8, bacterial cultures harboring control plasmid (pSOD16cf2) or 
hSOD-orfl. 2, 6, 7 and ORF5 expression piasmids were induced wrth 1PTG for approximately four hours. 
Cells were pelleted, lipid and protein from 0.024D equivalent of cells were electrophoresed on 12% 
Laemmli gels as described in § D.3. Protein was transferred onto nitrocellulose filters in carbonate buffer. 
Immunoblots were incubated with a 1:200 dilution of human HDV antiserum followed by incubation with 125 l- 

15 labeled sheep antihuman IgG antibody, and washed as described in §D.3.a. Lysates appear in the following 
order: lane 1, pSOD16cf2; lane 2, pSOD-orfl; lane 3, pSOD-orf2; lane 4, port5; lane 5, pSOD-orf6; and lane 
6, pSOD-orf7. 

Figure 9 also shows that the products expressed from porf5 which react with HDV specific antibodies 
are of two molecular weight species, approximately 27k and 24 k. As described below, these polypeptides 
20 contain immunogenic epitopes shared by both hepatitis viral polypeptides p27 5 and p24 s . The presence of 
27kd and 24kd poly peptides in HDV has been recently reported. Bergmann, K.F., and Gerin, J.L, J of Inf 
Diseases (1986) 154 :702; and Bonino, F., et al, J Virol (1986) 58:945. In addition, as shown below, these 
polypeptides also probably comprise the hepatitis delta antigen (HDAg). HDAg was originally found in the 
nuclei of hepatocytes of infected individuals. Rizzetto, M., et al, Gut (1977) 18: 997. 

25 

D.3.d. The Expression in Yeast of HDV Polypeptides Encoded in ORF5, and Partial Purification of the 
Product 

30 A yeast expression vector was constructed which directed the synthesis of required ORF5 encoded 
HDV polypeptides. Expression of this plasmid, pYAG-5p1, in yeast strain AB 110 yielded a 195 amino acid 
polypeptide which is immunologically reactive with HDV antiserum, and which is putative ly viral protein 
p245. 

The yeast expression vector, pYAG-5p1 , was constructed as follows: First, pAG-5p1 was constructed by 
35 inserting ORF5 from clone 5115 ligated to a new linker, into an expression cassette in PBS100. The 

cassette, which can be expressed with Bam HI contains an ADH1-GAP regulatable promoter upstream of the 

unique Ncol site and a GAP terminator downstream of a unique Sai l site. After cloning pAG^pl in coli 

HB101, the ORF5 containing expression cassette was restricted from pAG-5p1 with Bam HI, and ligated into 

the yeast shuttle, vector pAB24, which had been restricted with Bam HI. The resulting piasmids were cloned 
40 in E col[ HB101, and a shuttle plasmid, pYAG-5p1, was selected; for expression of ORF5, yeast strain 

AB110 was transformed with this plasmid to yield AB110(pYAG-5p1). 

A sample of yeast strain AB1l0(pYAG-5p1) has been depositecPwith the ATCC, 12301 Parklawn Drive, 

Rockville, Maryland 20852, and has been assigned Accession No. . This deposit will be maintained 

under the conditions specified in the Budapest Treaty. 
45 More specifically, the ORF5 containing expression cassette was constructed by ligating the following: a 

gel-purified 605 b.p. fragment obtained by digesting clone 5115 with Sstll and Sail; a new linker (linker 3); 

and a gel-purified 5841 b.p. fragment obtained by digesting PBS1 00 with Nco l and Sail. The sequence of 

linker 3 was: 

so 

5 ' CATG AGC CGG TCC GAG TCG AGG AAG AAC CGC 3 ' 
TGC GCC AGG CTC AGC TCC TTC TTG G 

55 
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The plasmid PBS100 is described in U.S. patent application serial No. 760,197, which is assigned to the 
herein assignee, and which is hereby incorporated by reference. This plasmid contains a yeast expression 
cassette cloned into pAB12, a pBR322 derivative. The expression cassette contains a hybrid ADH2-GAP 
promoter, a GAP terminator, and nonessential sequences between the Nco I and Sail sites; these latter 

5 sequences were replaced with the ORF5 region from clone 5115. The ADH2-GAP promoter is a 1200 bp 
Bam HI-Ncol fragment isolated from plasmid pJS103. 

Plasmid pJS103 was constructed as follows: The ADH2 portion of the promoter was constructed by 
cutting a plasmid containing the wild-type ADH2 gene from plasmid pADR2 (Beier et al. Nature (1982) 
300:724-728) with restriction enzyme EcoRV, which cuts at position +66 relative to the ATG start codon, as 

to well as in two other sites in pADR2, outside of the ADH2 region. The resulting mixture of a vector fragment 
and two smaller fragments was resected with Bal31 exonuclease to remove about 300 bp. Synthetic Xho l 
linkers were ligated onto the Bal31 -treated DNA. The resulting DNA linker vector fragment (about 5 kb) was 
separated from the linkers by column chromatography, cut with restriction enzyme Xho l, religated, and used 
to transform E. coli to ampicillin resistance. The positions of the Xho l linker were determined by DNA 

75 sequencing. One plasmid which contained an Xho l linker within the 5' nontranscribed region of the ADH2 
gene (position -232 from ATG) was cut with the restriction enzyme Xho l , treated with nuclease S1 , and 
subsequently treated with the restriction enzyme Eco RI to create a linear vector molecule having one blunt 
end at the site of the Xho I linker and an Eco RI end. The GAP portion of the promoter was constructed by 
cutting plasmid pPGAPI with the enzymes Bam HI and EcoRI, followed by the isolation of the 0.4 Kbp DNA 

20 fragment. This purified fragment was then completely digested with the enzyme AM and an approximately 
200 bp fragment was isolated. This GAP promoter fragment was ligated to the ADH2 fragment present on 
the linear vector described above to give plasmid pJS103. 

The plasmid pPGAPI is a yeast expression cassette vector which has a poly restriction site linker 
between the GAPDH terminator and a truncated GAPDH promoter region. The poly restriction site contains 

25 the recognition sites for Nco l, Eco RI, and Sail, and the cassette is excisable as a Bam HI fragment. The 
preparation of pPGAPI is described in EPO 0 164 556 and Travis, J., et al, J Biol Chem (1985) 260(7)- 
:4384-4389. In both references pPGAPI is referred to pPGAP. 

Plasmid pAB12 is a pBR322 derivative which lacks the region between the unique Hindlll and Sal I 
sites, and contains a Bam HI linker in the unique Eco RI site. The vector was constructed by digesting 

30 pBR322 to completion with Hindlll and SaM, followed by limited digestion with Ball nuclease. The resulting 
ends were eluated with Klenow and the blunted ends ligated with T4DNA ligase to reform closed covalent 
circles. These circles were then digested to completion with Eco RI, the overhangs filled in with Klenow, and 
the blunt ends were ligated with Bam HI linkers. Excess linkers were removed by digestion with BamHI, and 
covalently closed circles were formed by ligation. 

35 Plasmid pAB24 is a yeast shuttle vector which contains the complete 2u sequence (Broach, in 
Molecular Biology of the Yeast Saccharomyces , 2:445, Cold Spring Harbor Press (1981)) and pBR322 
sequences, ft also contains the yeast URA3 gene derived from plasmid YEp24 (Botstein, et al, (1979) Gene 
8:17) and the yeast LEU 2 ^ gene derived from plasmid pCl/1 (described in European Patent Application 
publication No. EPO 0 116 201). Plasmid pAB24 was constructed by digesting YEp24 with EcoRI and 

40 religating the vector to remove the partial 2u sequences. The resulting plasmid, YEp24ARI, was linearized 
by digestion with Clal and ligated with the complete 2u plasmid which had been linearized with Clal. The 
resulting plasmid, pCBou, was then digested with Xbal and the 8§BS**bp vector fragment was gel isolated. 
This isolated Xba l fragment was ligated with a 4460 bp Xba l fragment containing the LEU^ gene isolated 
from pCl/1; the orientation of the LEU 5 ^ gene is in the same direction as the URA3 gene. Insertion of the 

45 expression cassette was in the unique Bam HI site of the pBR322 sequences, thus interrupting the gene for 
bacterial resistance to tetracycline. Figure 9 presents a map of pAB24. showing the restriction enzyme sites 
and some distinctive features. 

Expression of ORF5 in yeast was accomplished using yeast strain AB110 which had been transformed 
with pYAG-ap1. Yeast strain AB110 has been described in U.S. patent application Serial No. 620,662, which 

so is assigned to the herein assignee, and which is hereby incorporated by reference. The genotype of AB110 
is MATa, ura 3-52, ]eu2-04 or both leu 2-3 and leu 2-112, pej) 4-3, his 4-500 [cir 0 ]. 

For expression, cells from a frozen stock were streaked onto Leu" plates and incubated at 30 °C. A 
single colony was inoculated into Leucine selective media [synthetic minimal media, amino acid supplement 
(w/o Leu), 8% glucose; Sherman et al in Laboratory Manual for Methods in Yeast Genetics , Cold Spring 

55 Harbor Laboratory 1986, pp. 163-169; and the culture was incubated with shaking at 30° C. 2 ml of the 
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culture were then inoculated into 100 mis Leu~ Media, 3% glucose and incubated with shaking at 30 °C. 
When the culture had reached saturation, 50 mis was inoculated into 11 of Leu~ media, 1% glucose. The 
culture was incubated with shaking at 30°C until the density was measured at ODe5o = 1-75 OD/ml at which 
point the cells were pelleted and either stored at -80° C or processed for protein purification. 

5 Orf 5 encoded proteins expressed in yeast were partially purified as follows. The yeast expression 
culture, AB110(pYAG-5p1), was pelleted and the volume of packed cells was estimated. The 5p1 protein 
was purified using the glass bead lysis method. The cell pellet was resuspended in 2 volumes (vol.) of 
Buffer I (50 mM Tris-HCI pH 8.0, 1mM EDTA, 1mM phenylmethyl sulfonyl fluoride (PMSF), 1 ug/ml 
pepstatin A) and 1 vol. of glass beads (0J25 mm, acid and heat treated). The cells were lysed by vigorous 

10 vortexing and kept at 4°C. The suspension was centrifuged, the supernatant was removed, and the eel! 
pellet was washed in 2 vol. Buffer 1,1% Triton X-1 00 and then centrifuged. The supernatant was removed 
and the pellet was washed 3 times with 2 vol. of Buffer I; during the last wash the glass beads were 
removed from the protein suspension. The washed pellet was extracted 2 times with equal volumes of 
Buffer I, 6M Urea to solubilize the protein. The supernatants were combined and diluted 1:10 with Buffer I 

75 and stored at 4°C with 20 mM sodium azide as a preservative. The final step before use of the protein was 
to dialyze twice in 100-300 vol. of Buffer I without PMSF and pepstatin. 



DA Identification of Polypeptides Encoded Within ORF5 

20 

The polypeptides encoded within ORF5 were identified as those of p27* and p24 5 , by direct 
comparison of the sizes expressed in bacteria recombinant unfused polypeptides with that of the pZT 6 and 
p24 5 present in HDV particles and in HDAg-positive liver extracts. The ORF5 encoded polypeptides were 
further identified as p27^ and p24 5 on the basis of immunological competition between the recombinant 
25 polypeptides expressed in yeast and p27* and p24 5 for HDV antibodies. Finally, the OFtFS-encoded 
polypeptides were identified as components of nuclear HDAg by the competitive binding of the recombinant 
polypeptides with the nuclear HDAg, as monitored by indirect immunoperoxidase staining of HDV-infected 
liver slices. 

30 

D.4.a. Comparison of Anti-HDV Antibody Binding Polypeptides Expressed in Bacteria from porfS with 
p24 5 and p27* in HDV Particles and in HDV Infected Liver 

The expression of the ORF5-encoded polypeptides from ports in bacteria, and the preparation of the 

35 lysates were as described above in § D.3. Lysates in HDV particles and of HDV infected liver were kindly 
prepared by K.F. Bergmann according to the procedure described by K.F. Bergmann and J.L Geria, J of Inf 
Diseases (1986) 154:702. which is hereby incorporated by reference. In the preparation of liver lysates, liver 
samples were minced with scissors and washed with PBS followed by homogenization with a Potter- 
Elvejem Apparatus in 6 M guanidinium HCI (pH 6). After 1-3 hr of incubation at 4°C. the extracts were 

40 centrifuged at 1500 g for 10 min, diaiyzed against PNS, and centrifuged again. Viral lysates were prepared 
by modifying the reported procedure to omit the BSA. Serum samples were layered over 20% sucrose, 
0.02 M HEPES (pH 7.4), 0.01 M CaCI2, 0.01 M MgCir, and were*Writrifuged in an SW 41 rotor for 5 hr at 
150,000 g to pellet the virus. The pellets were held in 0.05 M Tris (pH 6.8) and 2% SDS. 

Lysates of bacteria expressing ORF5 were eiectrophoresed in lanes adjacent to extracts of pelleted 

45 HDV or extracts of HDAg positive liver on 12% Laemmli gels, immunoblotted on nitrocellulose and 
incubated with HDV antiserum (1:400 dilution). Figure 10A shows the immunoblot of extracts of HDV virus 
(lane 1 ), of infected liver lysate (lane 2), and a lysate of bacteria after expression of porfS (lane 3). As seen 
in Figure 10A, two major immunoreactive polypeptides in the bacterial lysates appeared to comigrate with 
the p27* and p24 5 polypeptides extracted from pelleted virus and from HDAg-positive liver. Several low 

so molecular weight immunoreactive polypeptides were also present in the bacterial lysate; these may 
represent proteolytic products of p27 and/or p24. 
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D.4.b. Immunological Competition Between ORF5 Polypeptide Expressed in Yeast or Bacteria and p24 a and 
p27 5 in HDV Particles or in HDV Infected Liver 

Immunological competition between recombinant ORF5 products and viral peptides, p24 5 and p27*, 

5 was determined by competitive binding assays. In general, the HDV antiserum was allowed to absorb to the 
recombinant 0RF5 products produced either in yeast or in bacteria. The preabsorbed serum was compared 
to control serum as to its ability to bind to viral p24 5 and p27* in an immunoblotting procedure. The control 
serum was HDV antiserum which had been preabsorbed with the expression products of yeast or bacterial 
cultures transformed with the control (parental) vectors. The expression conditions were those which 

w allowed expression of the recombinant vectors containing ORF5. 

Immunological competition between ORF5 products expressed in yeast strain AB110 transformed with 
pYAG- 5 p1 and p24 5 and p27^ in HDV particles was determined as follows. The recombinant ORF5 products 
were expressed in yeast and partially purified under the conditions described in §D.3. Extracts of HDV 
particles were prepared, read on Laemmli gels, and blotted as described in §D.4.a. ( except that in the 

is immunoblotting procedure 5% nonfat milk in 1 x PBS (0.14 M NaCI, 2.5 mM KCl, 1.5 mM K H2PO4, 8 mM 
Na2H POa. 12 H20, pH = 7.4) was used as a blocking agent prior to incubation of nitrocellulose filters with 
HDV antiserum. The blotted HDV polypeptides were incubated with HDV antiserum which had been 
preabsorbed with extracts from 0.44 ml of yeast culture (OD 650, 16 OD/ml) expressing either: (1) the 
parental control plasmid, pAB 24; or (2) the ORF5 containing plasmid, pYAG-5p1 . 

20 Figure 10B presents the immunoblots using HDV antiserum preabsorbed with lysates of yeast 
expressing either the control plasmid (lane 1); or the ORF5 containing plasmid (lane 2). As seen in the 
figure, preabsorption of the HDV antiserum with the recombinant ORF5 polypeptides completely eliminated 
antibody binding to HDV polypeptides p24 5 and p27*; preabsorption with ADV antiserum preabsorbed to 
the control lysate did not prevent the binding. The weak, diffuse band in Figure 10B, iane 2, may represent 

25 nonspecific binding, since it was also present when control sera lacking HDV antibodies replaced HDV 
antiserum. From the binding in common of the polyclonal antibodies in the HDV antiserum, it may be 
deduced that the ORF polypeptides were immunologically identifiable as p24 5 and P27 5 . 

Immunological competition between ORF5 polypeptides expressed in bacteria or yeast and p24 5 and 
D27* in infected liver was also determined. The expression of ORF5 polypeptides in yeast strain HB110 

30 (pYAG-6p1) was as described immediately above; expression of ORF5 polypeptides in E coli D1210 
transformed with porfS was under the conditions described in §D.3. Liver lysates were prepared and blotted 
as described in §D.4.a., except that the above described modification in the blotting procedure was also 
used. The blots of the polypeptides in HDAg positive liver extracts were incubated with HDV antiserum 
which had been preincubated with the following: extracts of yeast cultures AB110 expressing pAB24 

35 (control); extracts of yeast cultures AB110 expressing ORF 5 from pYAG-5p1; extracts of E. coli D1210 
expressing pSOD16cf2 (control); and extracts of E. coli expressing porf5. Preabsorption of the HDV 
antiserum with: (1) the yeast cultures was with 0.44 ml of ODeo, 16 OD/ml; and (2) E. coliwas with 
approximately 100 ml of O.D^m 0.6 OD/al. The blots were incubated with 1:1000 dilutions of the 
preabsorbed HDV antiserum. In addition, as a control, a blot was incubated with HDV antiserum which was 

40 preincubated with an equal volume of dialyzed urea extraction buffer. 

Figure 10C presents the immunoblots of HDV polypeptidesjising the" serum preabsorbed with: yeast 
expressing the control plasmid (lane 1); yeast expressing ORF5 (lane 2); E. coli expressing the control 
plasmid Qane 3); E. coli expressing ORF5 (lane 4); and buffer control (lane 5). As seen in the figure, 
preabsorption of HDV antiserum with ORF5 polypeptides expressed in yeast completely eliminated the 

45 binding of HDV specific antibodies to p27* and p24 5 in HDAg-positive liver extracts. Orf 5 polypeptides 
from bacterial cultures also appeared to eliminate the binding of HDV specific antibodies to p2r, and 
reduced the binding of these antibodies to p24 5 by at least 10 fold based on densitometry tracings of the 
original autoradiograms. The residual binding of HDV antiserum to p24* is probably due to the limiting 
amount of the ORF5 polypeptide in bacterial extracts. In none of the controls was there a significant 

50 reduction in the binding of HDV antigen to p24 5 and p27* from HDAg infected liver. 
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D.4.c. Immunological Competition Between ORF 5 Encoded Polypeptides and Nuclear HDAg in HDV 
Infected Liver 

Section of HDAg positive liver were incubated with HDV antiserum which had been preabsorbed with 

5 ORF5 encoded polypeptides expressed in yeast, or with a control lysate. The preparation of the preabsor- 
bed HDV antiserum, including controls, was as described in § D.3.B. The sections were subsequently 
incubated with a peroxidase labeled antihurrtan IgG, and indirect immunoperoxidase staining was per- 
formed. The procedures were according to Govindarajan. S., et ai, Histopathology (1984) 8:63, which is 
hereby incorporated by reference. In this method, a preliminary blocking of endogenous peroxidase was 

10 carried out. Deplasticized sections were incubated with preabsorbed HDV antiserum for 30 min. In a moist 
chamber at RT. followed by 2 washes with PBS and treatment with horseradish peroxidase conjugated 
rabbit antihuman IgG in a moist chamber. Subsequently, the sections were rinsed in PBS for 10 min. and 
treated for 5-8 min. with 3-3' diamino-benzidine hydrochloride and hydrogen peroxide. After dehydration the 
sections were cover slipped and examined by light microscopy. 

75 Figure 1 1 shows photographs of the stained liver sections. The photographs were taken at a magnifica- 
tion of 150x. Figure 11A shows the indirect immunoperoxidase staining obtained with HDV antiserum 
incubated with the control yeast lysate. Figure 11B shows the staining when the HDV antiserum is 
preabsorbed with ORF5 encoded polypeptides expressed in yeast. In contrast to the clear binding to nuclei 
of antibodies against HDAg observed in HDV antiserum preabsorbed with the control extract, there was no 

20 binding when the antiserum was preabsorbed with the recombinant ORF5 polypeptides. 

Heretofore direct evidence that p24 5 and p27* are components of nuclear delta antigen has been 
lacking. The data provided above indicates that the ORF5 encoded products complete with the nuclear 
delta antigen for HDV specific antibodies. The data also show that ORF5 encodes 2 polypeptides which are 
the same size as p24 5 and pZl 6 , and which have the same immunoreactive epitopes as those viral 

25 polypeptides. Hence, the combined data show that ORF encodes viral polypeptides p24 5 and P27 5 , and that 
these polypeptides are components of nuclear delta antigen in HDV infected liver. 



D.5. Hybrid Particle HDV Immunogens 

30 

U.S. Serial No. 650,323, filed 12 April 1984 and assigned to the same assignee is incorporated herein 
by reference. This application describes the construction of hybrid particles of hepatitis B surface antigen 
(HBsAg) containing inserts of foreign immunogens into a presurface (pre-S) coding portion in reading frame 
with the codons for HBsAg. Plasmid pDClOl, described therein contains a portion of the pre-S/HBsAg 

35 gene, including 55 codons of the pre-S region, in a GAPDH controlled expression cassette cloned into the 
Bam HI site of a pBR322 derivative. The incorporated application describes the insertion of desired 
immunogens, such as the gD (glycoprotein D) antigenic site into a unique Eco RI site present in the pre-S 
region of pDClOl to give the hybrid plasmid pDC103. Similarly, in accordance with the present invention, 
desired epitopes derived from the HDV genome, particularly those encoded in ORF5, may be provided with 

40 suitable Eco RI linkers and inserted in proper reading frame into the Eco RI site of pDC101 f or used to 
replace the gD codons in the pDC103 hybrid. pDC103 is deposited with ATCC and has Accession No. 
20726. 

Hybrid particle immunogens are thus prepared using fused coding sequences for HDsAg and HDV and 
provide enhanced immunogenicity for the HDV epitopes. 

45 

D.6. Production of Antibodies to ORF5 Encoded Polypeptides 

Antibodies to 0RF5 encoded polypeptides are produced by immunizing an animal with partially purified 
so ORF5 encoded polypeptides expressed in yeast strain AB1 10(pYAG-5pl). The expression conditions and 
partial purification procedures for the yeast ORF5 products are those described supra. The polyclonal 
antibodies thereby derived may be purified from those directed against ORF5 encoded polypeptides by 
affinity chromatography, i.e., by passing the antiserum through affinity columns containing the expression 
products of the parental plasmid, pAB24. The antibodies to ORF5 products should be in the effluent The 
55 techniques for preparing affinity columns are known to those of average skill in the art. 
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Utility 

The invention disclosed herein has the following industrial uses. THe information on the nucleotide 
sequence of the HDV genome may be used to design nucleotide probes which are useful for the diagnosis 

5 of HDV infection; these probes may also be used in diagnostic kits. The nucleotide sequence information 
may also be used to synthesize peptides and polypeptides which, in turn have the following uses. The 
peptides and polypeptides synthesized from ORF5 sequences, in particular, are useful for diagnosing HDV 
infections as reflected by the presence of HDV antibodies, since ORF5 encodes the polypeptides 
comprising the HDV 6 antigen. In addition, the products of expression of ORF5 sequences are useful in the 

io production of vaccines to HDV, and in the preparation of HDV antibodies, both polyclonal and monoclonal. 
HDV antibodies directed against the ORF5 products may be used for the diagnosis of HDV antigens, based 
upon the presence of the antigens themselves, these antibodies may form the basis of diagnostic kits for 
HDV. In addition, the antibodies may also be used in vaccines against HDV. 

The peptides or polypeptides synthesized from other ORF sequences may also be used to raise 

75 antibodies against HDV encoded components. These antibodies, as well as the ORF sequence products, 
may be useful in determining the viral replicative cycle and the cellular interactions with the viral 
components. This knowledge, in turn, will be useful for the commercial development of vaccines against 
HDV. 

20 

Claims 

I. A nucleotide sequence useful for the production of hepatitis D virus (HDV) diagnostics or vaccines 
derived from the HDV genome or its complement as represented in Figure 2. 

25 2. A nucleotide sequence useful for the production of HDV diagnostics or vaccines derived from the 
HDV genome or its complement as represented in Figure 2, wherein the sequence comprises a region 
derived from open reading frames (ORFs) 1-12, and preferably is derived from ORF1, or ORF2, or 0RF5, or 
ORF6, or ORF7, and more preferably is derived from ORFS. 

3. A recombinant expression system comprising a coding portion of DNA derived from the sequence of 
30 claim 2 operably linked to a control sequence compatible with a desired host. 

4. Host cells transformed with the expression system of claim 3. 

5. Protein produced by the cells of claim 4. 

6. Antibodies raised by the protein of claim 5. 

7. An immunogenic particle which raises antibodies against HDV antigens, said particle comprising a 
35 recombinant polypeptide having an amino acid sequence capable of forming a particle when said sequence 

is produced in a eukaryotic host, wherein said polypeptide contains an epitope of HDV. 

8. An oligonucleotide probe useful for detecting the presence of HDV in a biological sample which 
comprises a DNA sequence of 8 or more nucleotides derived from the HDV genomic sequence or its 
complement, as represented in Figure 9. 

ao 9. A kit for analyzing biological samples for the presence of HDV which comprises the oligonucleotide 
probe of claim 8 and instructions for said analysis. 

10. A kit for analyzing the presence of HDV antigens in biological samples comprising antibodies raised 
to the protein of claim 5 or the particle of claim 7, and instructions for said analysis. 

II. A kit for analyzing the presence of HDV antibodies in biological samples comprising a polypeptide 
45 containing an epitope encoded within the nucleotide sequence of ciaim 2. 

12. A vaccine for treating or preventing HDV infection comprising a polypeptide which is encoded in the 
nucleotide sequence of claim 2. 

13. A method for producing a protein which contains an epitope which is immunologically identifiable as 
an epitope contained within an HDV antigen, comprising incubating host cells transformed with a recom- 

50 binant vector under conditions which allow expression of the vector, wherein said vector comprises a 
nucleotide sequence derived from the HDV genome or its complement as represented in Figure 2, said 
sequence optionally being as further defined in Claim 2. 

1 4. A method of detecting HDV comprising hybridizing with a biological sample a nucleotide sequence 
derived from the HDV genome or its complement as represented in Figure 2, said sequence optionally 

55 being as further defined in Claim 2. 
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15. A method for producing antibodies to HDV comprising vaccinating an animal with a protein which 
contains an epitope which is immunologically identifiable as an epitope contained within an HDV antigen, 
wherein the protein is synthesized in host cells transformed with a recombinant vector which comprises a 
nucleotide sequence derived from the HDV genome or its complement as represented in Figure 2, said 
sequence optionally being as further defined in Claim 2. 

16. The use of a nucleotide sequence derived from the HDV genome or its complement as represented 
in Figure 2, said sequence optionally being as further defined in Claim 2, in the preparation of a vaccine 
against HDV or in the preparation of a kit for the detection of material characteristic of HDV. 
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GlyLeuProProLeuAlaGluMetSerArgSerGluSerArgLysAsnArgGlyGlyArg 
GGACTGCCGCCTCTAGCCGAGATGAGCCGGTCCGAGTCGAGGAAGAACCGCGGAGGGAGA 

G 



GluGluIleLeuGluGlnTrpValAlaGlyArgLysLysLeuGluGluLeuGluArgAsp 
6 1 GAAGAGATCCTCGAGCAGTGGGTGGCCGGAAGAAAGAAGTTAGAGG AACTCGAG AGAGAC 



LeuArgLysThrLysLysLysLeuLysLysIleGluAspGluAsnProTrpIieuGlyAsn 
121 CTCCGGAAGACAAAGAAGAAACTCAAAAAGATTGAGGACGAAAATCCCTGGCTGGGGAAC 

G 

IleLysGlylleLeuGlyLysLysAspLysAspGlyGluGlyAlaProProAlaLysArg 
181 ATCAAAGGAATTCTCGGAAAGAAGGATAAGGATGGAGAGGGGGCTCCCCCGGCGAAGAGG 

C 

AlaArgThrAspGlnMetGluValAspSerGlyProArgLysArgProLeuArgGlyGly 
241 G CCCGAACGGACCAGATGGAGG TAG ACTCCGGACCTAGGAAGAGGCCTCTCAGGGGAGGA 



Arg 

PheThrAspLysGluArgGlnAspHisArgArgArgLysAlaLeuGluAsnLysLysLys 
301 TT(^CCGA(^GGAGAGGCAGGAT(^CCGATOAAGGAAGGCCCTCGAGAACAAGAAGAAG 
G 

GlnLeuSerAlaGlyGlyLysAsriLeuSerLysGluGluGluGluGluIieuArgArgLeu 
361 CAGCTATCGGCGGG AGGCAAGAACCTCAGCAAGG AGG AAGAAGAGG AACTCAGG AGGTTG 



Asn 

Thr<51uGluAspGluArgArgGliiArgArgValAlaGlyProProValGlyGlyValIle 
421 ACCGAGGAAGACGAGAGAAGGGAAAGAAGAGTAGCCGGCCCGCCGGTTGGGGGTGTGATC 

A 

^ Leu 
ProLeuGluGlyGiySerXrgGlyMaftoGlyGlyGlyPheValProSerLeuGlnGly 
481 CCCCTCGAAGGTGGATCGAGGGGAGCGCCCGGGGGCGGCTTCGTCCCCAGTCTGCAGGGA 

T 

Gin 

ValProGluSexProPheSerArgThrGlyGluGlyl^uAspIleArgGlyAsnArgGly 
54 1 GTCCCGGAGTCCCCCTTCTCTCGGACCGGGGAGGGGCTGGACATCAGGGGAAACCGGGGA 

A 

ENID 

PheProTipAspIleLeuPheProAlaAspProProPheSerProGlnSerCysArgPro 
601 TTTCCATGGGATATACTCTTCCCAGCCGATCCGCCCTOT 
A A 
* 



GlnEND 

661 CAGTGA p j q 
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1 GTCCCCCTTCTCT CGGACCGGGGAGGGGCTGGACATCAGGGGAAACCGGGGATTTCCATG 
CAGGGGGAAGAGAGCCTGGCCCCTCCCCGACCTGTAGT CCCCTTTGGCC CCTAAAGGTAC 

15 AVA2 5AU96, 18 HPA11 NCI1 SCRFl, 23 MNL1 , 46 HPA11 NCI1 S 
CRF1, 56 NCOl , 57 NLA i ll , 

61 GGATATACTCTTCCCAGCCGATCCACCCTTTTCTCCCCAGAGTTGTCGACCCCAGTG AAT 
CCTATATGAGAAGGGTCGGCTAGGTGGGAAAAGAGGGGTCT CAA CAG CTGGGGT CA CTTA 

69 MBOll, 80 BIN1 MBOl, 105 ACC1 H1NC11 SAL1 , 106 TAQ1 , 

121 AAAGCGGGTTTCCACTCACAGGTTTGCGTCTC6CGTCCTTCTTTCCT CTTCGGGT CGGCA 
TTTCGCCCAAAGGTGAGTGTCCAAACGCAGAGCGCAGGAAGAAAGGAGAAGCCCAGCCGT 

146 HGA1, 152 THAI, 153 HGA1 , 165 NNL1 , 167 MBOll, 179 ML Al 1 

181 TGGCATCTCCACCTCCTCGCGGTCCGACCTGGGCATCCGAAGGAGGACGCACGTCCACTC 
ACCGTAGAGGTGGAGGAGCGCCAGGCTGGACCCGTAGGCTTCCTCCTGCGTGCAGGTGAG 

183 SFANl, 192 MNL1 , 195 MNL1 , 198 THAI, 201 AVA2 SAU96, 208 

EC0R11 SCRFl, 213 SFANl, 214 F0K1 , 223 MNL1 , 226 HGA1 , 231 
HAE2, 235 BSTX1 , 

241 6GATG6CTAAGGGAGAGCCACTTTTCTCTCGATTCTCTATCGGAATCTAGAGAGATTTGT 
CCTACCGATTCCCTCTCGGTGAAAAGAGAGCTAAGAGATAGCCTTAGATCTCTCTAAACA 

241 F0K1, 247 DDE1 , 269 TAQ1 , 271 HINF1 , 283 HINF1 , 286 XBA1 
# 287 MAE1 , 

301 GGGTCCCATTCGCCATTACCGAGGGGACGGTCCCCTCGGAATGTTGCCCAGCC6GC6CCA 
CCCAGGGTAAGCGGT AAT GGCTCCCCTGCCAGGGGAGC CTTA CAA CGGGTCGG.CCGCGGT 

301 NLA1V, 302 AVA2 NLA1V SAU96, 321 MNL1 , 329 AVA2 NLA1V SA 
U96. 334 MNL1 , 346 BGL1 A 351 NAE1 , 352 HPA11, 354 AHA1 1 BAN1 
HAE.11 NAR1 NLA1V, 355 HHA1 , 

361 GCGAGGAGGCTGGGACCATGCCGGCCATCAGGTAAGAAA.GGATGGAACGCGGACCCTGCA 
CGCTCCTCCGACCCTGGTACGGCCGGTAGTCCATTCTTTCCTACCTTGCGCCTGG6AC6T 

363 MNL1, 366 MNL1 , 372 NLA1V, 373 AVA2 SAU96, 377 NLA111, 3 
BO NAE1 , 381 HPA11, 382 CFRi, 383 HAE111, 400 F0K1 , 408 THAI 
, 411 AVA2 NLA1V SAU96 , 416 PST1 , 

421 GAGTGGGGTCCCGCCATTCCTGGGCGACCCTTGGGGGGGGGA6TCGGAATCGAGCATCGG 
CTCACCCCAGGGCGGTAAGGACCCGCTGGGAACCCCCCCCCTCAGCCTTAGCTCGTAGCC 

426 NLA1V, 427 AVA2 NLA1V SAU96, 43? EC0R11 SCRFl, 461 HINF1 
, 467 HINF1, 470 TAQ1 , 474 SFANl , 

481 GAGGGGCATCCCATGGCTCCACTGGTCCCCGGTGTTCCTAGCACCCCCTCCGGTCACTTT 
CTCCCCGTAG6GTACCGAGGTGACCAGGGGCCACAAGGATC6TGGGGGAGGCCAGTGAAA 

481 MNL1, 486 SFANl, 487 F0K1 , 491 NCOl, 492 NlAIH, 475 NLA 
IV, 504 AVA2 NLA1V SAU96, 508 NC11 SCRFl, 509 HPA11, 518 MAE 
' 523 HG1E11, 527 MNL1 , 530 HPA11, 533 MAE3 , 539 A5U11, 540 



1, 523 
TAQ1, 



541 C6AAGGGGGTCCGGGGTCCCGCTAGAT 
(3CTTCCCCCAGGCCCCAGGGCGATCTA 



601 



547 NLA1V, 548 AVA2 BAU96 ,551 HPA11 NCI1 SCRFl, 554 NLA1V, 
555 AVA2 NLA1V SAU96, 562 MAE1 , 
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